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Abstract 

In this project, students use a real-world, complex database and experience firsthand the consequences 
of inadequate data modeling. The U.S. Environmental Protection Agency created the database as part 
of a multimillion dollar data collection effort undertaken in order to set limits on air pollutants from 
electric power plants. First, students explore the database to identify design limitations from the 
perspective of a data analyst with a specific goal. Second, students create a new database design which 
overcomes identified problems. Through this case study, students develop the skill to infer usage 
implications by studying the design of an existing database. This is important since developers often 
inherit databases designed by others. Students also learn how to prepare data stored in a relational 
database for a data analysis project. By experiencing the consequences of an inadequate design from 
a user perspective, students can better appreciate the importance of relational database design 
principles and become more committed to using them. 
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1. INTRODUCTION 

John and Kayla had just started their new jobs as 
Data Analysts at the Utility Research Institute 
(URI), a non-profit organization that conducts 
research on behalf of its funding organizations -- 
primarily electric utility companies operating 
within the United States. John had an M.S. in 
Computer Science and had worked as a Database 
Management Administrator for the past five 
years. Kayla had just graduated with an M.S. in 
Mathematics with a concentration in Statistics. 
They had been assigned to work together on a 
project analyzing data that was compiled by the 
U.S. Environmental Protection Agency (EPA). The 
data was collected as part of a process for 


establishing the first ever national standards 
limiting emissions of hazardous air pollutants 
such as mercury from coal and oil fired power 
plants. The EPA had made the data available to 
the public in the form of a Microsoft Access 
database and the Institute wanted to use this 
data to determine boiler features and pollution 
control equipment that would satisfy emission 
standards for all of the newly regulated 
pollutants. 

2. BACKGROUND 

To kick off the project, Kayla and John's manager, 
Ravi, briefed them on the regulatory history of air 
pollutants within the U.S. utility industry. He said 
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that the recent 2011 ruling, known as the Mercury 
and Air Toxics Rule (MATR), imposed the first 
ever national limits on heavy metals such as 
mercury and acid gas emissions from coal and oil 
power plants (EPA 2011b). The rule specifically 
limited air emissions of mercury, filterable 
particulate matter and hydrochloric acid from coal 
and oil fired plants with at least 25 megawatt 
hours of generating capacity. The EPA had 
decided to use filterable particulate matter and 
hydrochloric acid as surrogates for all non¬ 
mercury metals and all acid gases respectively. 
To show the importance of the project, Ravi 
shared an article from a trade journal (Neville 
2012) in which industry representatives described 
MATR as the most expensive regulation under the 
Clean Air Act (CAA) in terms of direct costs and 
the most extensive intervention into the power 
market that the EPA had ever attempted. EPA's 
own detailed analysis estimated that the rule 
would affect about 500 coal-fired plants and 100 
oil fired plants at an annual cost of $9.6 billion 
(EPA 2011a). Given the significant compliance 
costs, it was likely that some utilities would be 
making "invest or retire" decisions for many 
plants -- especially older ones. 

John asked Ravi why the electric utilities hadn't 
been subject to earlier regulation of these air 
pollutants. Ravi explained that while electric 
utilities were no stranger to regulation under the 
CAA, they had been treated differently than other 
industries in the major 1990 amendments (EPA 
2013a). Congress passed these major revisions 
to better control urban air pollution (Title I), 
pollutants from mobile sources (Title II), toxic air 
emissions (Title III), acid rain (Title IV) and 
ozone-depleting chemicals (Title VI). Title V 
delegated responsibility for regulatory oversight 
to individual states via a permitting process. Title 
IV had imposed significant regulations on the 
utility industry to better control emissions of 
sulfur dioxide which contributes to acid rain. Title 
I had imposed limits on emissions of nitrous 
oxides and particulate matter which contribute to 
urban area smog and also impacted the utility 
industry. 

However, Ravi explained that the electric utility 
industry had successfully forestalled regulation 
under Title III of the amendments (e.g. Lemonick 
1990). Title III listed 189 air toxins for which the 
EPA was required to identify source categories 
that would be subject to future regulation under 
section 112 of the CAA. Standards under section 
112 were based on what was referred to as 
maximum achievable control technology (MACT). 
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For existing sources, MACT sets a minimum level 
of stringency called the floor which is the average 
emission "achieved by the best performing 
twelve percent of existing sources in the category 
or the best performing five sources for source 
categories with less than thirty sources" (EPA 
2013b). Quoting the CAA, congress had required 
the EPA to perform a study of the "hazards to 
public health reasonably anticipated to occur" as 
a result of emissions of listed air toxins and to 
regulate electric utilities underTitle III only" if the 
Administrator finds such regulation is appropriate 
and necessary after considering the results of the 
study" (EPA 2013c). A general report regarding 
all the listed air toxins by utilities was due in three 
years and an additional report addressing health 
effects of mercury emissions from utilities and 
other industries was due in four years. 

Kayla asked why congress had given utility 
companies a reprieve; it didn't seem to make 
sense if they were significant sources of the listed 
air toxins. Ravi surmised that congress may have 
been more lenient with utility companies under 
Title III since they were already primary targets 
of regulation under Titles I and IV of the 1990 
amendments. Both John and Kayla were 
surprised that emissions of heavy metals such as 
mercury, arsenic and lead had never been 
regulated within the utility industry. Noting that 
some individual states did limit power plant 
emissions of heavy metals such as mercury, Ravi 
agreed that it was surprising that so many air 
toxins from power plants had not been regulated 
at the federal level - at least until now. 

Mercury, in particular, had received significant 
attention (e.g. EPA 1997, Center for Disease 
Control 1999, Physicians for Social Responsibility 
2004). As explained in the 1997 EPA report, 
mercury released by industrial sources into the air 
can circulate in the atmosphere for up to a year 
and can be deposited on land and water 
thousands of miles from the original source. 
When heavy metal mercury is consumed by living 
organisms, it is converted to bioaccumulative 
methyl-mercury which becomes more 
concentrated in organisms higher in the food 
chain. A fact sheet issued by the Physicians for 
Social Responsibility (2004) describes mercury as 
a "potent neurotoxin" that affects the functioning 
of the central nervous system and explains that 
most Americans are exposed to mercury through 
the consumption of fish - especially of higher food 
chain predatory fish like swordfish and tuna. In 
its 1997 report, the EPA had estimated annual 
emissions of mercury within the U.S. to be about 
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158 tons of which 87% came from waste and 
fossil fuel combustion. However, since waste 
combustion had been subject to earlier 
regulation, fossil fuel combustion (primarily coal) 
was now the dominant source of mercury 
emissions in the United States. Ravi summed up 
the discussion by stating that two decades after 
the 1990 amendments, the 2011 MATR had listed 
electric utilities as a source category under 
section 112 of the CAA and that the long delay 
was a result of years of litigation between 
industry, non-governmental organizations, states 
and the EPA. 

3. RESEARCH PURPOSE 

The discussion then switched to the purpose of 
the research and the EPA data. Ravi explained 
that in order to gather the data needed to set the 
standards, the EPA had issued a two-phase 
information collection request in 2009 (EPA 
2009). In the first phase, electric generating 
units (EGUs) subject to the new regulation 
completed a twenty-five page paper survey 
providing the most recent twelve months of 
emissions test and fuel analysis data since 2005 
as well as data about plant equipment (e.g. boiler 
characteristics, pollution controls) and permitting 
requirements. In the second phase, the EPA 
selected EGUs who were believed to be the best 
performing units within specified pollutant 
categories. These EGUs were required to conduct 
emissions stack testing to measure flue gas 
entering the atmosphere and to conduct analyses 
of fuel used during testing. The cost of data 
collection and quality assurance was estimated to 
be about $10 million and the cost of stack and 
fuel testing was estimated to be about $66 million 
(EPA 2009). In order to leverage this investment, 
the Institute wanted to gain as much knowledge 
as possible from the EPA data which was made 
available to the public in the form of two MS 
Access databases -- one for each collection 
phase. They would start with the data from the 
first collection phase. Ravi was sure that this task 
alone would be very challenging. After they had 
mastered the Phase I database, they would 
consider integration of Phase II data. Links to the 
original data and descriptive information are 
provided in Table 1. 

The purpose of the current project is to determine 
which combinations of equipment provide the 
best overall control of multiple pollutants. Certain 
boilers can remove pollutants during combustion 
or while coal is being burned. For example, 
fluidized bed boilers float and tumble burning coal 
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on upward jets of air. The tumbling allows solids 
such as limestone to be mixed in with the coal 
and absorb pollutants such as sulfur dioxide. 
Other boilers are designed to burn coal at lower 
temperatures which inhibit the formation of 
nitrous oxides. In addition, different types of post 
combustion controls can remove pollutants from 
the flue gas before it is released into the air 
through the smokestack. 

Kayla had one nagging question: What was the 
value of analyzing equipment that wasn't 
intended to control emissions of the newly 
regulated pollutants? Ravi explained that the EPA 
(2011a) had argued that the new standards were 
based on "existing, commercially proven 
technologies that are...frequently used in this 
industry such as electrostatic precipitators, fabric 
filters (bag houses), flue gas desulfurization 
(scrubbers) or dry sorbent injection." In other 
words, equipment used to control sulfur dioxide, 
nitrous oxides, particulate matter also controlled 
emissions of the newly regulated pollutants - at 
least according to the EPA. Indeed as Ravi 
pointed out, the EPA was using particulate matter 
as a proxy for all non-mercury metals. "So does 
this mean, the newly regulated pollutants were - 
in effect -- already being regulated" Kalya asked? 
Ravi wasn't so sure stating that "these are the 
kinds of questions we need to answer with our 
research" and that "controls for different 
pollutants may interact in ways that do not 
simultaneously reduce all regulated pollutants". 

4. A DATA NARRATIVE 

John had spent the last week studying the Phase 
I EPA database and was meeting with Kayla to 
give her an overview of what he had learned so 
far. He also wanted to get a better understanding 
of what data and what format would be required 
to conduct statistical analyses. Referring to the 
EPA database diagram, John convinced Kayla that 
the EPA Phase I was complex involving many 
dimensions. It contained forty different tables 
which were linked together by almost as many 
relationships. He showed her a sketch (Figure 1) 
of the data entity relationships which he had 
created based on the EPA database diagram. 

In order to get a better understanding of the 
content of the database, John explained to Kayla 
that he had created a smaller "test" M.S. Access 
database by deleting some of the tables and fields 
from the first phase EPA database. He believed 
that the smaller database contained the most 
important data for their research project and that 
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the simplification would facilitate their preliminary 
analysis. All relationships were those created by 
the EPA and no records had been deleted from the 
remaining tables. A screen shot of MS Access 
relationships in the test database is shown in 
Figure 2. 

John had many questions but would do his best 
to explain the Figure 2 diagram to Kayla. A 
facility, described in the facility_information 
table, is all the property, plant and equipment 
that resides at single geographic location and that 
has a legal owner. A configuration is a set of 
equipment components ordered by their physical 
location within the electricity generation process. 
A facility can have multiple configurations which 
are possibly operated concurrently at a given 
point in time or possibly which have changed over 
time due to the addition, modification or removal 
of particular equipment. Configurations are 
described in the configuration_components 
table. 

Each configuration starts with one or more of 
what was labeled as a "unit". Each unit is in turn 
mapped to one or more boilers in the 
unit boilers table and boilers are described in 
the boiler_ information table. John knew that 
the information in the boiler_information table 
would be important but he did not know what a 
"unit" represented. It seemed that the label 
"unit" was so generic that it could represent any 
kind of equipment. Question 15 of the EPA survey 
required "identification (or designation) of all 
coal- and oil-fired steam generating units 
(boilers) (as defined by Clean Air Act section 
112(a)(8)) located at this facility" The question 
parenthetically indicates that a steam generating 
unit is a "boiler" and a footnote indicates that 
either a Boiler id or a Generator id can be 
provided: 

Boiler ID as reported on U.S. DOE/EIA 
Form EIA-860 (2007), "Annual Electric 
Generator Report", schedule 6, part A, 
line 1, page 53 OR on schedule 6, part B, 
line 1, page 54 OR Generator ID as 
reported on "U.S. DOE/EIA From EIA- 
923 (2008)" 

John wondered whether allowing the 
interchangeable use of boiler and generator ids 
was a source of design problems. According to 
the language of the CAA, the EPA is required to 
regulate steam generating units and the CAA 
defines an "electric utility steam generating unit" 
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as "any fossil fuel fired combustion unit of more 
than 25 megawatts that serves a generator that 
produces electricity for sale". Based on this 
definition, the steam-generating unit is not the 
same as the generator that produces electricity. 
The former generates steam and the latter 
generates electricity. Like the term "unit", 
"generate" also had multiple meanings. Further 
adding to the confusion, the term "steam" was 
often use to describe a generator as indicated on 
EIA (Energy Information Administration) Form 
860: 

Enter the identification (ID) code for 
each boiler that provides steam to each 
combustible-fuel steam generator ... and 
for each combined cycle steam turbine 
generator. Boilers may be associated 
with multiple generators. 

It was also apparent that there is a many-to many 
relationship between boilers and generators. In 
order to clarify the terminology, John conducted 
some research and settled on the following 
definitions: 

A boiler is a vessel which burns fuel to 
boil water and create expanding, 
pressurized steam which is transferred to 
at least one turbine. The thermal energy 
will be converted into rotating kinetic 
energy. 

A turbine is a rotor with blades that is 
connected to the shaft of a generator. It 
uses rotary motion to convert kinetic to 
mechanical energy. 

A generator is copper wire coiled around 
a shaft that is surrounded by a giant 
magnet. When the shaft is rotated, 
electric current is created on the wire, 
converting mechanical energy to 
electrical energy. 

Fie was confident that these definitions provided 
much needed semantic clarity. And he had also 
discovered that the qualifier "steam" was used to 
distinguish the type of turbine which in addition 
to steam included water, wind and gas types. 

Based on the survey instructions, unit id was 
possibly meant to refer to a generator but he was 
still unsure. In the entire phase I database, there 
was NO additional information stored about units 
beyond the id itself. Fie was puzzled why the 
configuration table included units and not boilers. 
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CAA regulatory rules apply to boilers and not 
generators. This issue required further research; 
he had a nagging concern that it would be a cause 
of problems for their research. 

Each configuration also has at least one chimney 
- called a stack - where gas exits the process. 
One or more pollution control devices may be 
installed after the unit and before a stack. The 
database contained four major groups of such 
post-combustion controls devices including 
particulate matter (PM) controls, nitrous oxide 
(NOx) controls, sulfur dioxide (S02) and other 
controls. The "other" category contained mercury 
(Hg) control devices and Kayla and John agreed 
these would need to be separated. Control 
devices which are relatively independent (e.g. 
can be removed and relocated within a 
configuration or installed within another 
configuration) are referred to as "facility" controls 
and are described in the facility^controls 
table. In addition, boilers have design features to 
control NOx pollution which are described in the 
boiler nox control table. Air is sampled 
through ducts called sampling ports which can be 
placed at different locations within the process as 
well as at the exhaust stack. John noted that only 
pollution controls that were located upstream of 
(e.g. before) the sampling location should be 
associated with pollutant measurements at that 
location. 

In the survey, utilities provided historical 
emissions data in the form of test reports. Each 
test report often corresponded to a compliance 
reporting requirement and each report in turn 
consists of multiple sampling runs where 
measurement devices collect and analyze 
samples of air during a discrete period of time. 
Multiple sampling runs might be used to ensure 
that measurements reflect steady state 
conditions of the electricity generation process. 
Each sampling run is in turn associated with one 
or more pollutants for which emissions are 
reported. The database contained emissions data 
for 106 different pollutants -- although many of 
these were infrequently reported. Kayla and John 
decided to focus only on the following pollutants: 
filterable particulate matter, sulfur dioxide (S02), 
nitrogen oxide (NOx), total mercury (Hgt) and 
hydrogen chloride (HC1). John was initially 
confused about which type of mercury he should 
use but he had verified that total mercury is the 
sum of elemental mercury (HgO), particulate 
bound mercury (Hgp) and oxidized mercury 
(Hg++). So for now, they would extract only Hgt. 
To further complicate matters, emissions were 
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reported using different units of measurements 
including emissions rates (e.g. weight emitted per 
time period), emission factors (weight per heating 
fuel content) and concentrations (parts per air 
volume). These units of measurement are 
interdependent in that one may be derived from 
others given additional data. Kayla had done 
some initial investigation on converting emissions 
to a common unit of measurement and found that 
it was not straightforward. There were multiple 
conversion formulas which each made different 
assumptions and required different additional 
data. So to begin their analysis, Kayla and John 
agreed to use only sampling runs which reported 
emissions factors as pounds per million British 
Thermal Unit (Ib/MMBtu) since this was the most 
frequent unit of measurement in the 
sampling_run_ pollutants table. Emissions 
data is contained in testreports, 
sampling_runs and sampling_run_pollutants 
tables. 

5. THE EXTRACT 

John wanted to know what data format would be 
required for statistical analysis. Kayla explained 
that the typical input for statistical software is a 
two dimensional file or table. Each row 
represents an observation and each column 
represents a variable. John referred to this type 
of input as a flat "denormalized" table. Kayla 
continued explaining that it is usual in statistical 
analyses that some variables are dependent 
(those to be predicted or explained) and others 
are independent (those that form the basis for 
explanation or prediction). Computers scientists 
might more easily understand dependent and 
independent variables as output and input 

variables. In the current project, dependent 

variables are pollutant emissions and 

independent variables are boiler characteristics 
and control equipment. 

They needed to determine the unit of analysis or 
observation and tentatively decided to define the 
observation as a unique combination of boiler 
characteristics and pollution controls at a 
particular facility. They would average pollutant 
emissions to this level of analysis. John 
suspected there might be some situations where 
emissions measurements could not be 

unambiguously associated with unique equipment 
and in these cases the emissions data should be 
excluded from the analysis. It was also important 
that a single emissions measurement was not 
averaged into multiple observations since this 
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would bias results by weighting some 
measurements more heavily than others. Also, 
multiple configurations of identical equipment at 
a specific facility should be merged into a single 
observation. 

The discussion switched to data types - which 
was more straightforward than level of 
aggregation. Kayla suggested coding 

boiler firing type as a categorical data type 
with the following possible values: tangential, 
wall, cyclone, fluidized bed, integrated gas 
combustion cycle (IGCC) and other. Although not 
all statistical procedures handled categorical data 
types, initially she would conduct descriptive 
analyses by boiler firing type. The mapping 
of specific boiler firing type values which exist in 
the EPA database to the extract categories is 
shown equipment classification hierarchy shown 
in Table 3. For example, "front wall", "rear wall", 
"opposed wall", and "other" boiler firing types 
should be mapped to "wall" firing type. 

Since a configuration can have a varying number 
of controls within a single category, Kayla 
suggested coding facility and boiler pollution 
controls as Boolean data types with 1 indicating 
presence of the control and 0 otherwise. Kayla 
and John drafted a preliminary structure for a 
data extract shown in Table 2. Like boiler firing 
types, the equipment classification hierarchy in 
Table 3 maps specific controls to the general 
controls in the extract file. 

6. PRELIMINARY DATA ANALYSIS 

John had some concerns about possible data 
anomalies which would affect the integrity of the 
data used for their research project. Their goal 
was to unambiguously relate emissions 
measurements to boiler characteristics and 
control equipment that was operational at the 
time of the test. He recognized that parts of the 
database did not meet normalization principles 
and some referential integrity constraints were 
missing. He came up with a plan to 
systematically investigate these issues. First, he 
would manually try to create extract records for 
some sample facilities. He had successfully done 
this for facility 663. MS-Access screenshots and 
the extract records for this facility are shown in 
figures 3 and 4 respectively. He had identified five 
additional facilities which he thought might 
present problems and would manually try to 
construct extract data for these facilities. The 
identifiers for the test facilities are: 56, 898, 
1073, 1507 and 2324. For example, a potential 
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problem for facility 1073 is that units 1 and 2 
were each mapped to four boilers (1-4) in the 
unit boilers table. He was concerned that the 
boilers would have different characteristics and 
had begun researching this plant using data at the 
Energy Information Administration web site. He 
had learned that in fact only boilers 1 and 2 
should both be mapped to units 1 and 2. Second, 
in the process of creating extract data for the five 
facilities, he would make a list of problems in 
terms of relating emissions to equipment data. 
Third, he would design a new database to 
overcome any problems and input the data for the 
five facilities as a means of testing the new 
design. He hoped that this would demonstrate 
the viability of reformatting and importing all of 
the EPA data into the new design. He knew that 
the data would be used by the Institute for years 
to come and he was concerned that researchers 
would again and again need to deal with data 
anomalies for each analysis. Undoubtedly, 
assumptions would need to be made to resolve 
certain data ambiguities but at least they would 
be made explicit and uniformly applied to all 
future analyses. 
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Appendix 



Figure 1 - Sketch of Entity Relationships 
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Figure 2 - M.S. Access Relationships in Test Database 
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Survey (see enclosure 1) 
Data dictionary 

Data Diagram 
MS Access Database 
Other Related Links 


www.epa.gov/ttn/atw/utility/gl/eu mact icr part b.pdf 

www.epa.gov/ttn/atw/utility/pro/eu mact icr part-i ii- 

data dictionary.pdf 

www.epa.gov/ttn/atw/utility/pro/eu mact icr part-i ii-db erd.pdf 

www.epa.gov/ttn/atw/utility/eu icr parti partii.mdb 

www.epa.gov/ttn/atw/utilitv/utilitypg.html 


Table 1 - Links to Original Data and Descriptive Information 


Potential Identifiers 

I. Facililty_ID 

2 . Configuration_ID 

3. Boiler_ID 

4. Unit_ID 

5. Sampling_Port_ID 

Boiler characteristics 

6. Boiler_Firing_Type 

7. Boiler_MaxHeatInput 

8. MWe_Capacity 

9. Primary_Fuel 

NOx Boiler Controls 

10. LoNox_Burner 

II. Ovr_Fire (Over air fire) 

12. Other_BoilerNOx 

NOx Facility Controls 

13. SCR (selective catalytic reduction 

14. SNCR (selective noncatalytic 
reduction) 

15. Other_Nox 

Mercury Facility Controls 

16. ACI ( activated carbon injection) 

17. DSI (dry sorbent injection) 


PM Facility Controls 

18. ESP ( Electrostatic precipitator) 

19. PM_Filter 

20. PM_Scrubber 

21. PM_Cyclone 

22. PM_Other (all other PM) 

S02 Facility Controls 

23. Wet_Fgd (Wet Flue Gas 
Desulfurization) 

24. Dry_Fgd (Dry Flue Gas 
Desulfurization) 

Pollutant Emissions 

25. PM_F (PM - Filterable) 

26. S02 (Sulfur Dioxide - S02) 

27. NOx ( Nitrogen Oxide - NOx) 

28. Hgt (Total Mercury Hgt) 

29. HC1 - (Hydrogen Chloride HC1) 


Table 2 - Structure of Extract 
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Boiler Firing Types 

1. Tangential Firing 

2. Wall Firing 

2.1. Front Wall Firing 

2.2. Rear Wall Firing 

2.3. Opposed Wall Firing 

2.4. Other Wall Firing 

3. Cyclone Firing 

4. Fluidized Bed Firing 

5. Stoker Firing 

5.1. Stoker Underfeed 

5.2. Stoker Overfeed 

5.3. S toker Spreader 

5.4. Stoker Other 

6. Integrated Gas Combustion Cycle 
(IGCC) 

7. Other B oiler Firing Type 

Pollution Control Types 

1. Particulate Matter (PM) Controls 

1.1. Electrostatic Precipitator (ESP) 

1.1.1. Cold Side ESP with Flue Gas 
Conditioning 

1.1.2. Cold Side ESP without Flue 
Gas Conditioning 

1.1.3. Hot Side ESP with Flue Gas 
Conditioning 

1.1.4. Hot Side ESP without Flue 
Gas Conditioning 

1.2. PM Filter 

1.2.1. Pulse Filter 

1.2.2. Reverse Air Filter 

1.2.3. S hake and Deflate Filter 

1.3. PM Scrubber 

1.3.1. Syngas 

1.3.2. Wet 

1.3.3. Venturi 

1.4. PM Cyclone 

1.4.1. Multiple Cyclone 

1.4.2. Single Cyclone 

1.5. PM other 


2. Nitrous Oxide (NOx) Controls 

2.1. Facility Nox Controls 

2.1.1. Selective Catalytic Reduction 

2.1.2. Selective Non-Catalytic 
Reduction 

2.1.3. Facility Nox Other 

2.2. Boiler NOx Controls 

2.2.1. Boiler Nox Controls 

2.2.2. Fow NOx Burner 

2.2.3. Overair fire (including 
advanced) 

2.2.4. Boiler NOx Other 

3. Sulfur Dioxide(SQ2) Controls 

3.1. Wet Flue Gas Desulfurization 
(WFGD) 

3.1.1. Wet FGD-Disk 

3.1.2. Wet FGD Flooded Disk 

3.1.3. Wet FGD Jet Bubbling 
Reactor 

3.1.4. Wet FGD Spray 

3.1.5. Wet FGD Tray 

3.1.6. Wet FGD Spray and Tray 

3.1.7. Wet FGD Other 

3.2. Dry Flue Gas Desulfurization 
(DFGD) 

3.2.1. Dry FGD Sorbent Injection 

3.2.2. Dry FGD Spray 

3.2.3. Dry FGD Other 

4. Mercury Controls 

4.1. Activated Carbon Injection 

4.2. Dry Sorbent Injection 

4.3. Other Facility Controls 

4.4. Boiler Controls 


Table 3 - Equipment Classification Hierarchy 
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O' A *) - P- v 


File 


Create 


Microsoft Access 


External Data Database Tools Add-Ins 


I ^ I i5i 

v Q 



3a conflguration_components_qry c=i (H £5 



componentjd 

B-2 

* configi *■ 

001 

ord ▼ component_tYpe 

1 Unit 

facilityJc » control_groi 

663 

control_type 

control_onli * 


PM-HSESP 

001 

2 PM control 

663 PM control 

Electrostatic precipitator, hot side, unspec 

10/1/1987 


NOx-SCR 

001 

3 NOx control 

663 NOx control 

Selective Catalytic Reduction 

5/1/2009 


S02-DS 

001 

4 S02 control 

663 S02 control 

Dry FGD - Unspecified 

5/1/2009 


Stack 

001 

5 Stack 

663 



Record: H 1 of 5 

► m 

No Filter Search 



l 


'Z 


boiler_information_qry 


E) S3 


facility - 

663 

boilei » 

B-2 

boiler_type - bottom_design 

Wall firing - unspecified 

» prir - 

coal 

max » MWe_ * design_ - design_te - hour 

2428 235 1980 1005 ▼ 

Record: H 

1 of 1 

► > > 1 lo Filter Search < 


llll | ► 


boiler_control_nox_qry 


0 

0 

£3 

facility id Q boilc - 

NOx_type 

- 6 - NOx_online - -*■ 

^ B-2 

Overfire Air 

10/1/1981 a 

* 



Record: M 1 of 1 ► 

► • No Filte Search 

i ^L 


ta a 


File 


- - |v 

Home Create 


External Data 


Microsoft Access 

Database Tools Add-Ins 


1 ^ 1 . 1 =!.— 1 


test_reports_qry 



cd S S3 


faci - report ^ - 

rc 

- report_description - co - 

report_be - repc 


663 other 

1 

Annual Compliance Te 001 

8/15/2005 


663 other 

2 

Annual Compliance Te 001 

7/20/2006 


663 other 

3 

Annual Compliance Te 001 

10/17/2007 


663 other 

4 

Annual Compliance Te 001 

6/23/2008 


663 other 

5 

Annual Compliance Te 001 

5/27/2009 



Record: M 4 4 of 5 

► 

►I ► • No Filter Search 

1 4 I llll I B 


^ sampling_run_pollutants_qry cd [U S3 


facilil » rep * 

samp - 

rur - 

pollutant_name 

- emission_factc - 

e 


663 1 

Stack 

avera 

Nitrogen Oxide (NOx) 

0.517 pound per mi 

lion British Thermal Unit (Ib/MMBtu) 


663 1 

Stack 

avera 

PM (Filterable) 

0.0122 pound per mi 

lion British Thermal Unit (Ib/MMBtu) 


663 1 

Stack 

avera 

Sulfur Dioxide (S02) 

1.02 pound per mi 

lion British Thermal Unit (Ib/MMBtu) 


663 2 

Stack 

avera 

Nitrogen Oxide (NOx) 

0.524 pound per mi 

lion British Thermal Unit (Ib/MMBtu) 


663 2 

Stack 

avera 

PM (Filterable) 

0.0179 pound per mi 

lion British Thermal Unit (Ib/MMBtu) 


663 2 

Stack 

avera 

Sulfur Dioxide (S02) 

0.961 pound per mi 

lion British Thermal Unit (Ib/MMBtu) 


663 3 

Stack 

avera 

Nitrogen Oxide (NOx) 

0.463 pound per mi 

lion British Thermal Unit (Ib/MMBtu) 


663 3 

Stack 

avera 

PM (Filterable) 

0.0185 pound per mi 

lion British Thermal Unit (Ib/MMBtu) 


663 3 

Stack 

avera 

Sulfur Dioxide (S02) 

1.087 pound per mi 

lion British Thermal Unit (Ib/MMBtu) 


663 4 

Stack 

avera 

Nitrogen Oxide (NOx) 

0.522 pound per mi 

lion British Thermal Unit (Ib/MMBtu) 


663 4 

Stack 

avera 

PM (Filterable) 

0.0233 pound per mi 

lion British Thermal Unit (Ib/MMBtu) 


663 4 

Stack 

avera 

Sulfur Dioxide (S02) 

0.971 pound per mi 

lion British Thermal Unit (Ib/MMBtu) 


663 5 

Stack 

avera 

Nitrogen Oxide (NOx) 

0.08 pound per mi 

lion British Thermal Unit (Ib/MMBtu) 


663 5 

Stack 

avera 

PM (Filterable) 

0.021 pound per mi 

lion British Thermal Unit (Ib/MMBtu) 


_663 5 

Stack 

avera Sulfur Dioxide (SQ2!_ 

1.04 oound Der mi 

lion British Thermal Unit (Ib/MMBtu! v 

Record: M 1 of 15 

► ►! 

► . No Filter Search 

1 i 1 I 

►1 


4 | m 


8a. Unique report Id or name 


| 


Num Lock 


I Ms is 


sot 



Figure 3 - MS Access Screen Shots for Facility 663 
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Extract Records 1 


Potential Identifiers 




1. Facililty ID 

663 

663 

663 

2 . Configuration ID 

1 

1 

la 

3. Boiler ID 

B-2 

B-2 

B-2 

4 . Unit ID 

B-2 

B-2 

B-2 

5. Sampling Port ID 

Stack 

Stack 

Stack 

Boiler characteristics 




6. Boiler Firing Type 

Wall-firing 

Wall-firing 

Wall-firing 

7 . Boiler MaxHeatlnput 

2428 

2428 

2428 

8 . MWe Capacity 

235 

235 

235 

9 . Primary Fuel 

coal 

coal 

coal 

NOx Boiler Controls 




10 . LoNox Burner 

0 

0 

0 

11. Ovr Fire (Over ai r fi re) 

1 

1 

1 

12. Other BoilerNOx 

0 

0 

0 

NOx Facility Controls 




13. SCR (selective catalytic reduction 

0 

0 

1 

14. SNCR (selective noncatalytic reduction) 

0 

0 

0 

15. Other Nox 

0 

0 

0 

Mercury Facility Controls 




16. ACI ( activated carbon injection) 

0 

0 

0 

17. DSI (dry sorbent injection) 

0 

0 

0 

PM Facility Controls 




1. ESP ( Electrostatic precipitator) 

0 

1 

1 

2. PM Filter 

0 

0 

0 

3. PM Scrubber 

0 

0 

0 

4. PM Cyclone 

0 

0 

0 

5. PM Other (all other PM) 

0 

0 

0 

S02 Facility Controls 




6. Wet Fgd (Wet Flue Gas Desulfurization) 

0 

0 

0 

7. Dry Fgd (Dry Flue Gas Desulfurization) 

0 

0 

1 

Pollutant Emissions 2 




8. PM F (PM - Filterable) 

NULL 

0.017975 

0.021 

9. S02 (Sulfur Dioxide-S02) NULL 

1.00975 

1.04 

10. NOx ( Nitrogen Oxide - NOx) 

NULL 

0.5065 

0.08 

11. Hgt (Total Mercury Hgt) 

NULL 

NULL 

NULL 

12. HC1 - (Hydrogen Chloride HC1) 

NULL 

NULL 

NULL 

Applicable Dates 3 

10/81-9/87 

10/87-4/09 

5/09-present 

1) Configurations shown here in columns would be in rows in the actual statistical extract. 

2) The second configuration emissions are averages of first four reports 2005 - 2008. Only test report 5 
emissions from 2009 should be associated with the third configuration. 

3) 10/81 Over Air Fire Control installed. 10/87 ESP installed. 5/09 SCR and FGD installed. 



Figure 4 Sample Extract Records for Facility 663 


©2014 EDSIG (Education Special Interest Group of the AITP) 
www.aitp-edsig.org /www.isedj.org 


Page 15 















